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SECTION  I 
INTRODUCTION 


A.  OVERVIEW 

Rapid  advances  have  been  made  during  tlie  past  several  years  in  large-scale  integrated  circuit 
(LSIO  technology.  These  advances  have  had  a  significant  impact  on  many  military  signal  processing 
t'unctions  found  in  such  applications  as  forward-looking  infrared  (I'Ll  R )  radar,  guidance  and  control, 
and  ECM  systems.  In  particular,  image  processing  system  studies  for  video  bandwidth  reduction. 
FLIR  automatic  cueing,  3-D  target  classification,  and  image  understanding  have  consistently  recom¬ 
mended  using  LSIC  technologies  to  perform  critical  image  processing  functions.  A  general-purpose 
algorithm'  which  uses  the  linear  operation 


.\i 


on  a  single  video  line  or  on  an  n  by  n  block  of  picture  elements  is  an  iiieal  candidate  to  be  imple¬ 
mented  with  LSIC  technologies. 

While  such  algorithms  can  be  e.xecuted  easily  at  low  data  rates  using  general-purpose  mini¬ 
computers  or  even  commercial  microprocessors,  it  is  usually  not  possible  to  execute  them  in  real 
time  in  an  airborne  environment  because  of  excessive  si/e.  vveiglit,  power  dissipation,  and  cost.  Ihe 
key  to  effective  system  design  is  to  apply  LSIC  technology  to  minimi/e  the  overall  component 
count  and  variety  of  components  while  absorbing  as  much  as  possible  of  the  control  and  timing 
logic  onto  the  information  processing  chip  themselves.  The  solution  is  optimum  when  the  same 
chips  can  be  used  for  a  multitude  of  other  applications  to  provide  a  high  volume  market.  These 
requirements  have  lead  to  the  desire  for  a  prograittmable  chip  architecture,  and  in  turn,  to  the  con¬ 
cept  of  a  parallel/serial  input  bus.  The  discovery  of  the  read-only  memory  (ROM)  accumulate'-'''' 
algorithm  to  implement  the  above-mentioned  linear  operator  could  have  significant  impact  on  image 
processing  systems. 

B.  OBJECTIVE 

The  objective  of  this  12-month  exploratory  development  effort  was  to  develop  a  general- 
purpose  digital  LSIC  device  called  the  programmable  image  processing  element  (PIPT  ).  which  can 
be  programmed  to  perform  a  variety  of  signal  processing  functions  such  as  Cosine  and  lladamard 
transforms,  edge  extraction,  unsharp  masking,  pole-zero  filtering,  and  signal  smoothing  on  8  b\  1 
or  3  by  3  element  data  blocks  at  full  TV  data  rates  of  10  megasamples  per  second.  Such  a  dev  ice 
should  find  widespread  application  in  anti-jam  video  data  links,  I  LIR  automatic  cuers,  target 
classifiers,  and  digital  filter  processors.  Tlence.  the  PIPl'  LSIC'  is  suited  for  those  airborne  applica¬ 
tions  where  data  rate,  size,  power  and  weight  restrictions  prohibit  the  use  of  general-purpose 
microprocessors  or  high-speed  digital  multipliers.  This  contract  will  address  only  the  design  and 
photomask  fabrication  of  the  PIPE  LSIC’. 

C.  SUMMARY 

During  the  design  pha.se  of  this  contract,  investigations  of  ROM  technologies  aiul  designs  to 
ensure  a  user-oriented  image  processing  LSIC  were  conducted.  The  results  of  this  investigation  are 


detailed  in  the  IMIM  l.SK  desiiin  discussion  given  in  Subsection  11. A.  A  brief  summary  of  the  PlPh 

I  SIC  follows. 

Texas  liistiumeiUs  was  particularly  well  i|ualified  to  execute  this  contract  successfully 
because  of  ils  low  risk  implementation  approacli.  This  low  risk  implementation  approach  is  a 
result  of: 

•  I  ser-oriented  PIPl.  1  SIC  architecture 

•  f.stahlished,  cost-effective  LSIC  technology 

•  Proven  erasable,  programmable  ROM  design 

•  Breadboard  emulator  of  PlPli  LSIC. 

Several  architectures  were  investigated  as  candidates  for  implementing  the  PIPL  LSIC.  The 
architecture  selected  m;iximi/es  flexibility  in  algorithm  implementation  and  minimi/.es  external 
control  and  timing  logic. 

I  he  N-channel  inctal-oxide-sctniconductor  (NMOS)  technology  was  selected  to  implement 
the  PIPl  LSIC  because  it  is  an  established,  cost-effective  technology  capable  of  providing  the  high- 
circuit  ilensity.  low  speevl-powcr  proiluct,  and  user  erasable  programmable  read-only  memory 

I I  PRO.Mi  neevlcil  lor  the  user-oriented  PIPL  LSIC  architecture. 

The  NMOS  erasable  programmable  read-only  memory  was  judgcil  to  be  the  best  technology’ 
to  meet  the  goals  of  the  PIPl’  LSIC.  Hie  advantages  of  NMOS  liPROM  technology  are; 

•  Liser-programmabic  telcctrically ) 

•  lirasable  (ultraviolet  light) 

•  Nonvolatile 

•  Single  supply  voltage  operation 

•  Static  on-chip  NMOS  logic 

•  Milit.iry  gravie 

•  l.stahlishevi  technology. 

lexas  Instruments  has  available  in  production  quantities  a  family  of  military -grade  LPROMs 
ranging  in  si/e  from  8K  (  I  K  =  1024  bits)  to  32K.  This,  along  with  the  features  of  LPROMs,  makes 
the  I  PROM  technology  ideal  for  the  PIPL  LSIC  program. 

1  he  PIPL  LSIC  ilesign  completed  during  this  contract  is  shown  in  Ligure  1.  As  illustrated  in 
this  figure,  there  are  five  major  sections  of  this  integrated  circuit : 

•  Input  latch  parallel-to-serial  shift  register 

•  LPROM 

•  Shift-and-accumulatc 

•  I  ri-statc  output  hitch 

•  Controller. 
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Figtire  I  PIPE  LSK  Livoui 


IvacU  of  tlK'sv;  Uoivtions  is  Uiscosscil  in  ilcUiil  in  Subscclions  11. A. 2  Ilirougli  ll.,\.(i,  rcspec- 
tivu'K.  I  ho  totiil  b;ir  si/o  is  appro.xinuitolx  240  In  2"0  mil-',  uilh  ;i  li'Ial  ostinialotl  power  of  oOO 
milliwatts. 

loxas  Instruments  believes  the  proposed  Phase  II  PIPl  proeiiiin  is  very  relevant  and  timely 
for  the  development  of  a  programmable  iimige  proeessing  element  :md  h;is  earelullv  eonsidered  the 
applieations  ot'  the  PIPE  LSK  ,  I  he  PIPE  ESK  is  ivleal  lor b\  ,>  window  operations  and  8  by  1  or 
'•  by  I  matrix  operations.  A  nune  iletailed  diseussion  of  the  ;ipphe;itions  of  the  PIPE.  LSIC  is  pre¬ 
sented  in  Subsection  lEB. 


SECTION  II 

TECHNICAL  DISCUSSION 


A.  PIPE  LSIC  DESIGN 

losiis  liislninu'iils  luis  (.k'sigiK'il  a  inouraninialilc  iiiiajic-iMoccssing-cIcment  (Pll'hi  larye-scak- 
inti'uratod  circuit  (LSIC)  durini:  Phase  I  of  the  PIPl'  projtrani.  contract  1  5'7‘)-C-l  TCc'.  IJetaiK 
ot'  tliis  design  are  discussed  in  this  section. 

I.  Introduction 

Man\  image-processing  algoritlims'  rei|iiire  a  sum  ol  products  operator  ot'  the  tdrm 


M 


1=1 


where  the  represents  a  set  ol'  I'ised  programmal''le  weighting  coet'l'icients,  \,  represents  a  set  ■') 
se(|uence  ot  input  \alues,  and  M  represents  the  number  ol  inputs.  This  mathematical  lunction  car, 
be  iiseil  to  calculate  the  coet'l'icients  of  various  transforms  used  in  mans  image-processing  applisa- 
tious  such  as  l  ourier.  Cosine,  liadamtird.  Ilaar.  and  others.  I'he  sum  of  products  expression  can 
also  be  used  in  determining  many  neighborhood  operators  which  perform  such  operations  as  noisf 
smoothing,  edge  enhancement,  and  edge  crispening.  For  many  image-processing  applications  such 
as  video  bandwidth  reduction,  forward-looking  infrared  (1  LIKi  aulocneing.  target  classification. 
;unl  image  understanding,  algorithms  based  on  the  sum  of  products  operator  are  reciuired. 

I  he  sum  of  products  operation  of  I  (.pialion  I  can  be  implemented  using  digital  integrated 
circuit  (K  I  multipliers  and  an  accumuhnor,  as  shown  in  Figure  2.  However,  the  si/e  and  power 
rec|uired  to  perform  the  multiplictitiims  with  digital  1C  imdtipliers  at  \  ideo  data  rates  are  prohibitiw 
for  mans  airborne  image-processing  applications.  Hence,  invesiigtilions  have  been  perbinncd 
recently  on  techni(|ues  for  the  reali/ation  of  the  sum  of  products  operation  w'ilhout  using  digital 
multipliers. I  liese  di>tributed  tirithmetic  techniques  are  a  table  look-up  procedure  to  perform 
the  imdtiplicatiou  in  I  quatiou  i .  I  his  table  look-up  operation  replaces  the  Lligilal  imdtipliers  with  a 
re.ui-onls -memors  (ROM)  function.  I  he  principle  of  operation  of  the  ROM-accumiilate  algorithms 
is  discussed  below . 

As  noted  in  I  t|uation  I.  the  weighting  coefficients,  are  fi.xed  and  known  while  the  ini'ut 
words.  \^.  are  variable.  In  bimirs  arithmetic,  the  input  variable.  X^.  can  be  expressed  as 
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B\  I  qua t ion  2  into  i‘,tiualion  1 ,  the  followiniz  expression  is  ubtaineil 
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hi;kkali.'d  iciin  ..111  !■  '  calciil.ttorl  lor  all 
possihk'  s  oniInnaUoiis  anil  sloivil  in  an  ROM. 
llk'ivloiv,  till'  I'aitisular  i'i|  from  tlic 
iiiioniini.'  ikil.i  I'.ni  he  uscil  to  address  the 
ROM  and  lelih  the  associated  value  of  the 
hraskeled  term.  Once  the  ('artieular  value  ol' 
the  hr.kketed  term  is  obtained,  h.iiuation  4 
can  be  eomi'Uled  In  shitline  this  braeketed 
ieim  !v,  one  bit  position  belore  aeeunuilatin;! 
the  I'livioiis  Slim.  A  block  dia.i^rain  ol’  this 
ROM-aceunuikite  .ilaorithm  is  shown  in 
l  ieiire  .v  Note  liiat  the  X,  terms  are  bit-serial 
due  to  the  rearr.m.eeiiient  of  terms  in 
I  i|uatinn  -I  I  he  si/e  ol  the  ROM  rei|uiretl 

III  hieiire  .'  is  2'“'  words,  where  M  is  the  inaMimiin  number  ol  input  \ allies,  as  gisen  in  h.iiuation  1. 
1  he  number  ol  bits  pel  word  in  the  memory  is  the  sum  of  the  word  length  oh  W,  and  the  value  ol 
loe .  M . 
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Figure  2.  Block  Diagram  of  the  Multiplier- 
Accumulator  Algorithm  for  Implementing  Sum 
of  Products  Operator 


As  noted  in  the  .ibove  ihsciission.  the  input  data  to  the  ROM  is  bit-serial,  lloweier.  in  most 
apphs.ilions.  the  input  d.it.i  in  a  typical  image-processing  application  is  given  m  a  parallel  word 
lormat.  lienee,  the  idock  diagram  ol  Figure  3  is  moilil'ied  to  change  the  input  data  hrom  parallel 
lorm.it  to  bit-serial  hormat.  as  shown  m  I'igure  4.  I'his  input  data  can  be  ret'ormatted  with  a  parallel- 
to-SL  i  ial  slut  '  register  ciivint  Also  shown  in  I'igure  4  is  an  output  buhler.  which  buhleis  the  outputs 
ol  till'  hil  l-and-accumulale  lunction.  and  a  controller  lunction.  which  prov  ides  the  iiecessarv 
soiitrols  to  the  other  liiiietions.  ,.\  brief  discussion  ol  the  design  ol  the  PlIM.  I  Sit  lollows, 

I  he  lollowmg  design  goals  were  used  in  the  ilevelopment  ol  the  I’ll’l  l.SK'.  1  he  IMI’I 
1  Sl(  will  .lecepl  input  d.ita  being  '-IS  bits  at  a  20-Mll/  data  rate.  Data  entry  can  be  in  3  >  S  blocks 
Ol  '(  ■  I  bio,  ks  dehmed  b\  the  User,  Data  is  entered  via  an  input  strobe  at  a  20-Mll/  rate.  Output 
d.il.i  Is  pieseiiled  on  ,i  In  slate  bus  being  02()  bits  parallel.  Ouliiiit  timing  is  compatible  with  the 
input  .illowing  deike  .liammg  All  Inputs  and  outfnits  are  III-  and  CMOS-compatible  with  a 
l.inoNl  ol  ihiee  A  m.islei  ehxk  controls  all  data  seijiiencing  and  timing,  hhe  I’li’l  1  SIC  will  be 
iis.'i-pi  ol'i  .iiiim.ible 


(bit  serial) 


PARALLEL  INPUT  WORDS 


Figure  3 .  Block  Diagram  of  the 
ROM- Accumulator  Algorithm  for 
Implementing  Sum  of  Products  Operator 


T1k‘  simplified  block  diagram  of  the 
FIPH  LSIC  architecture  is  shown  in  Figure  5. 
As  shown  in  Figure  5.  the  input  data  S.  T,  and 
P  arc  parallel  words  that  can  he  loaded  into 
the  input  latches  serially  or  in  parallel.  The 
particular  mode  of  operation  is  controlled  by 
the  user  through  the  use  of  one  of  the  eontrt)l 
input  lines.  This  allows  the  PIPF  LSIC  to 
operate  on  b  X  1  blocks  or  3X3  blocks  of 


Figure  4.  Block  Diagram  of  ROM-.4ccumulator 
Algorithm  With  Parallel-to-Serial  Shift  Register, 
Buffer  Output,  and  Controller 


data.  In  the  serial  mode,  all  data  is  loaded 
through  the  P  input  pins  and  latch,  and  then 

is  seciuentialK  clocked  through  the  other  latches.  In  the  parallel  moile,  the  data  is  loaded  through 
the  S.  7,  and  P  input  pins  into  three  separate  input  latches.  Fhe  injnit  tlata  is  then  setiueiUially 
clocked  into  the  other  latches.  In  the  serial  mode,  nine  sample  perioils  are  required  to  load  all  of  the 
input  latches;  in  the  parallel  mode,  three  sample  periods  are  retiuired.  ’Fliis  method  of  data  enli\ 
eliminates  the  need  for  latch  address  pins  and  facilitates  convolution  and  sliding-window  operations 
with  a  single  part. 


The  bit-panillel  words  in  the  input  latches  are  converted  into  bit-serial  words  by  the  parallel- 
to-serial  registers.  The  outputs  of  the  parallel-to-serial  registers  form  the  b-bit  memory  arldress.  The 
memor\  outputs  are  shifted  and  accumulateil  to  complete  'he  sum  of  products  operation.  I'ri-state 
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Figure  5.  Simplified  Block  Diagram  of  the  PIPE  LSIC  Architecture 


output  latches  are  provided  lor  olY-chip  bulTering  All  timing  and  control  pulses  for  the  parallel- 
toserial  registers,  the  memory,  the  shift-and-aeeumulate,  and  the  output  buffers  are  generated 
on-chip  using  a  simple  shift  register  controller  with  a  loud  pulse  and  master  clock  as  inputs.  The 
shift-aiui-ac 'umulate  circuitry  will  operate  with  either  unsigned  magnitude  data  or  2‘s  complemeni 
data  as  selected  by  the  user.  The  design  is  very  user-oriented,  both  from  the  memory  technolog\ 
considerations  as  well  as  the  number  of  eontrol  lines  retiuired  to  operate  the  chip.  Table  1  defines 
the  control  inputs  of  Figure  .s. 

The  PIPI:  LSIC  design  and  layout  have  been  completed.  Figure  6  is  the  diagram  of  the  IC 
barmap  which  is  representative  of  the  actual  layout.  Figure  7  is  the  CALCOMP  plot  of  the  PIPE 
LSIC.  The  major  sections  of  this  layout  are: 

•  Input  latches  parallel-to-serial  shift  register 

•  I  PROM 

•  Shift-and-accu  inula  te 

•  Tri-state  output  latch 

•  Controller. 

fhe  area  and  cstimateil  power  lor  each  ot  these  sections  is  given  in  Table  2.  The  area  for  each  lunc- 
tion  m  this  table  does  not  include  the  area  used  for  lead  routing.  The  total  bar  size  is  appro.ximately 
24f)  X  270  mils’  with  a  total  estimated  power  of  (>00  mW.  Fach  of  the  functions  listed  above  is 


TABLE  1.  USER  DEFINED  CONTROLS 


Contrul  Line(s) 

Wi)r(.i  length  ( I-bil  BCD  code) 
Parallel  serial 

Master  clock 

Load 

Input  strobe 
d's  cotnplemettt 

tnable 

V 

''dd 

V 

PP 

Data  valid 

Two's  coinpleinent  coefficients 


Function 

Defines  the  word  length  in  bits  of  the  input  data 

Determines  inode  of  chip  operation:  3  X  or 
X  1  operations, 

A  square-wave  clock  provided  for  system  timing 
(<20  MHz). 

Initiates  tire  parallel-to-serial  data  conversion 
«20  MHz). 


Indicates  valid  input  data  and  latches  it  in  the 
input  latches  (<20  MHz). 

Defines  signed  oi  unsigned  magnitude  data 
operation. 

Used  to  tri-state  or  enable  the  output  bus. 

Single  -1-5-V  operating  supply 

Normally  at  5  V  but  taken  to  25  V  for  EPROM 
programming. 

An  output  signal  indicating  a  complete  computation. 

Used  to  set  the  sign  bits  of  die  output  word  when 
<8-bit  input  data  is  used. 
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Figure  6.  Diagram  of  the  PIPE  LSIC  Baimap 
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Figure  7  CALCOMP  Plot  of  PIPE  LSIC 
TABLE  2.  AREA  AND  ESTIMATED  POWER  OF  THE  PIPE  LSIC 


Function 

Area 
(mil  ) 

Power 

(mW) 

Input  luidi  I’  to-S  shift  register 

X.dOO 

17? 

EPROM  nieiiiory 

12,0(10 

1?0 

Shill-;m  rl-acciiniulale 

lo.xoo 

0? 

Tri  slale  output  latch 

2, (.00 

70 

( Diiliol  and  liming 

2.400 

1  10 

1 


dl^(.u^sod  ill  dct.ul  in  Snl'sci.  lions  II  A.d  lliionj!li  II  A  o.  ivs(n.'>.  tivolv  A  I’.n.dk'l  i-tlin  I  ol  i  li.ndw.iu 
dcinoiislialion  ol  Ilic  sum  ol  prodia  ls  uiu'ralor  iisni};  llu’  1<(.)M  .k aniiiiil.i In  aliioi illiiii  o  dis.  us'-ntl 
in  Snlisonlion  11. A  ’ 

2.  Input  Latnit  Parallel- to- Serial  Shill  Rejii.sler 

This  suhsoctioii  tiiscnsses  tlin  input  lalnli  parallcl-to  snnal  shill  rnpisler  Iniu  lion  ol  Ihn  PIPl 
l.SK  I  he  block  dKigram  of  this  inpul  sinicluie  is  shown  in  f  ipnrc  H.  I  he  mam  pails  ol  ihis 
structure  arc: 

•  Input  ninlliplcxcr  and  latch 

•  Parallcl-to-scrial  shift  register 

•  Memory  address  drivers 

•  Memory  address  select. 

funclioiia!  block  diagram  of  a  single  ni|nit  stage  (I  of  dj  is  shown  in  tignre  d,  '[  raiisisiors  (\l! 
through  Mltij  form  the  input  multiplexer;  latches  LO  through  1.15  I  lorm  the  iiiinit  latch;  regisieis 
Reg  I  through  Reg  14  form  the  [larallcl-to-serial  shift  register;  Reg  0  is  the  memor\  ilriver  icgisler 
with  increased  drive  c.i|iabilities;  and  Ml  7  is  the  aildress  select  multiplex  transistor  used  in  pio- 
gramming  the  Id’ROM.  1  he  input  multiplexer  switch  transistors  iMI  through  Mho  select  a  data 
path  for  serial  or  parallel  data  input  operation,  fliis  selection  is  made  possible  bv  tlie  ixirallel  or 
serial  (I’AR'SbRl  select  line  (IIL  level)  which  is  bulferetl  to  an  MOS  level  b\  nnerterll  as  shown 
ill  b'igure  d.  Inverter  12  piovides  the  complement  of  the  PAR  Sl.R  line  tor  selecting  the  multiplexer 


PAR  SER 
SELECT 


PARALLE  L 
LOAD 


ADDRESS 

SELECT 


P  INPUTS  T  INPUTS  S  INPUTS 


tNPUT 

LATCHES 

INPUT 

LATCHES  AND 

MULTIPLEXER 

INPUT 

LATCHES  AND 

MUL'f' IP  LEXER 

PARALLE  L-TO- 

PARALLE  L-TO- 

PARA  LLE  L-TO- 

SERIAL  SHIFT 

SERIAL  SHIFT 

SERIAL  SHIFT 

REGISTERS 

REGISTERS 

REGISTERS 

MEMORY 

MEMORY 

MEMORY 

ADDRESS 

ADDRESS 

ADDRESS 

DRIVERS 

DRIVERS 

DRIVERS 

W  1  i 

Aq  Aq  a,  a,  A2  A2 

tttttt 

^3  ^3  ^4  ^4  ^5  ^5 

1  ^  1  i  1 

^6  ^6  ^7  ^7  ‘*‘8  ^8 

INPUT 

STROBE 


0  CLOCK 


0  CLOCK 


Figure  S  Bhiek  Diagram  of  the  Input  Slruetiire 
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im  of  the  Input  Multiplexer  Latch  Parallel-to-Serial  Register 


switch  transistors  (M  1  tliroLi{;li  M  l(i|.  Once  a  data  patli  has  been  selected,  the  data  proceeds  tlirough 
latches,  LO  tlirough  L7  aiul  awaits  entry  to  latches  L8  through  L15.  The  data  is  latched  into  these 
registers  when  a  strobe  pulse  is  given.  This  latch  takes  place  on  the  rising  edge  ol'  the  strobe 
(STROBb)  pulse.  The  data  is  held  in  these  latclies  regardless  ol  changes  appearing  on  the  input  data 
line  since  latches  LO  through  L7  are  disabled  on  the  rising  edge  of  the  strobe  pulse.  The  strobe 
pulse  (TTL  level)  is  buft'ered  by  inverter  1.)  and  complemented  by  inverter  14  to  provitle  the  projier 
levels.  After  the  data  is  latched  (R)  through  LIS),  it  is  then  ready  for  the  parallel-to-serial  The 
parallel-to-serial  conversion  takes  place  in  registers  Rl  through  RI4.  On  each  |)hase  shift  (‘hSi 
clock,  data  is  transferred  upward  toward  the  memory  address  driver.  Figure  10  shows  a  timing 
diagram  of  this  tunction. 

.A  detailed  discussion  of  the  input  multiplexer  and  latch,  parallel-to-serial  shift  register,  mem¬ 
ory  address  drivers,  and  memory  address  select  circuits  follow.  The  last  topic  discussed  in  this 
subsection  is  the  level  converter  and  input  protection  circuit. 

a.  Input  Multiplexer  and  Latch 

Figure  II  shows  a  single-input  multiplexer.  Transistors  Ml  and  M2  provide  two  different 
data  paths  (A  or  B).  The  SELECT  line  is  presented  to  the  gate  of  M  I  and  its  complement  to  M2. 
A  high  logic  level  on  the  SELECT  line  will  turn  transistoi  Ml  on  and  allow  data  on  the  A  line  to 
propagate  to  point  C.  Similarly,  taking  the  SELECT  line  low  will  produce  a  logic  high  on  the  gate  of 
M2;  then,  data  on  line  B  is  allowed  to  propagate  to  point  C. 
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Figure  10  Timing  Diagram  of  the  Input.  Load,  and  Shift  Se(|uences 


The  input  lutcli  circuitry  is  implc- 
iiicntcd  by  using  twoN-channcl  MOS  (NMOS) 
latcli  strticturcs.  Since  tlic  NMOS  latch 
structure  is  the  basic  building  block  that  is 
used  in  the  input  latch  circuitry  as  well  as  in 
other  portions  of  the  PIPL  LSIC.  a  brief 
discussion  of  this  circuit  is  given. 

The  block  diagram  and  schematic  of  an 
NMOS  latch  are  shown  in  F’igure  12.  This 
latch  IS  composed  of  an  input  (.lata  path, 
transistor  .Ml.  followed  b\  two  NMOS  invert¬ 
ers  (M2.  M3)  and  (M4.  M5).  and  a  feedback 
p.ith  through  transistor  M()  These  two 
in\erters  are  constructed  with  depletion  toads 
|M2.  M4)  and  enh.mcement  (.Irivers  (M3.  M5). 


Mt 


Figure  1 1 .  Circuit  Diagram  of  the 
Input  Multiplexer 


I'he  operation  of  the  latch  circuit  is  as  I'ollows.  Data  enters  the  latch  through  M  1  and  appears 
at  node  1  when  the  L A K  1 1  line  is  high  (logic  I ).  If  the  data  entering  is  high,  then  M3  will  conduct, 
pulling  node  2  low  (logic  0).  Since  the  gate  of  M5  is  connected  to  node  2.  M.3  will  turn  off  transistor 
and  let  transistor  M4  pull  node  3  high,  which  is  the  same  information  at  the  input.  The  LATCH  line 
can  now  be  pulled  low.  causing  the  LATCH  line  to  go  high.  This  high  level  on  the  gate  ofM(5  will 
cause  Mb  to  conduct,  providing  a  feedback  path  from  node  3  to  node  1.  Since  Ml  is  now  in  a 
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nonconducting  state,  data  clianging  on  tlie  input  DATA  line  will  have  no  elTect  on  the  latched 
data.  It  the  data  initially  entering  the  latch  is  low.  a  low  level  will  he  latched  similarly .  A  timing  dia¬ 
gram  ol'  possible  inputs  and  latching  pulses  is  also  shown  in  I’igure  1  2. 

The  input  latch  circuitry  incorporates  a  D-type  latch  action,  as  shown  m  I'igure  13.  I  his 
edge-triggered  circuit  provides  greater  noise  immunity  and  shorter  data-\alid  lengths  than  a  level 
trigger  latch.  The  D-type  latch  function  is  implemented  by  using  two  NMOS  latches,  as  discussed 
above,  clocked  on  opposite  phases  of  the  strobe  (STROBE)  pulse.  The  STROBE  pulse  is  provided 
by  the  user  while  STROBE  is  generated  on  the  LSIC. 

Figure  14  is  a  partial  block  diagram  of  the  PIPE  input  structure.  The  input  data  can  enter  this 
structure  either  througli  the  S  input  lines  or  from  a  previous  stage  determined  by  the  parallel  or 
serial  (PAR/SER)  control  line.  Data  enters  latch  DL2  when  the  strobe  line  is  pulsed.  On  each  succes¬ 
sive  strobe  pidse.  data  shifts  to  the  next  latch.  For  examine,  consider  that  the  strobe  line  is  pulsed 
high  and  data  enters  from  the  S  inputs  to  latch  DL2.  On  the  next  strobe  pulse,  the  data  now'  in  DL2 
will  shift  to  DEI  and  the  new  data  will  enter  DL2.  On  the  next  strobe  pidse.  data  in  DEI  enters 
DEL),  data  in  DE2  enters  DE  I .  and  the  new  data  enters  DE2.  Once  all  of  the  input  latches  have  been 
loaded,  the  |iarallel-to-serial  conversion  can  be  initiated. 

h.  Parallel-t(hSerial  Shift  Registers 

partial  block  diagram  of  the  parallel-to-serial  shift  registers  is  shown  in  Figure  1.3.  Parallel 
input  data  enters  through  a  control  multiplexer  when  the  parallel  load  (PARAEEEE  EO.AD)  line  is 
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Figure  l.t.  Schematic  Diagram  of  the  Input  D-laitch 
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r'igure  14.  Partial  Block  Diagram  of  the  Input  Stage  Architecture  (3  Stages  of  9) 
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l  igurc  1  Partial  Block  Diagram  of  the  Parallel-to- Serial  Conversion 
14  Bits  of  One  S-Bit  Word) 


pulsod.  jikI  enters  tlic  dual  register  (two  NMOS  latelies).  Oiiee  ilie  I’AK  \1  111  I  (t\l)  line  leiL.i!!-. 
to  a  lo\v  logic  state,  the  phase  shilt  (>hS)  cluck  will  shilt  d.it.i  hits  Uuiii  the  piewucs  iccislci  lu  lii 
next  register.  I'he  phase  shit't  clock  is  generateil  on-chip  hy  (JR-ing  the  parallel  load.  <!’  clo.k  .iitd 
the  \'p  voltage  level.  This  operation  is  synchronous  with,  tlie  twivphase  nonov erlai'pmg  .  hnks  cl’ 
and  'I'l  1  hese  clock  pulses  are  generated  on-chip  Irnm  the  inaslei  elock  Mipphed  In  llu  icer 

<■  Memory  Address  Drivers 

1  he  memory  adilress  driver  is  the  last  stage  in  the  p.irallel-lo-seriai  shilt  register.  I  hc  ciKiiit 
diagram  ol'  the  memory  address  driv  n' is  shown  in  Figure  lo.  rhisciiLiiil  receives  e.ich  shifteil  inpiii 
data  hit  and  latches  this  data  for  one  clock  cycle.  The  latched  data  is  hnlYered  to  drive  the  memoiv 
address  decoders. 


d.  I’rofirammin}’  A  ddress  Select 

1  he  Vpp  line  is  taken  to  volts  during  the  iviogrammmg  of  ihe  l.l'KOM.  .X  special  \ Tp  volt 
.ige  divider  iFiglire  I  enables  the  progiamming  address  seleci  multiplexer  so  that  Ihe  inpul  dala 
lines  c.m  he  used  as  niemc.)rv  address  inputs  (Figure  IX).  The  Vpp  voll.ige  divider  circuit  also  pio 
vides  the  program  enable  (I’l  1  for  the  I  FROM.  I'his  will  make  the  meniorv  easv  to  program  and  no 
sfiecial  input  hit  shilting  is  required.  In  normal  oper.ition,  the  \'pp  line  is  kepi  al  a  .s-volt  level. 

e  Level  Converter  and  Input  I’roterfion 

The  I’lPF  IC  operates  on  a  single  +.s-volt  siq'ply  (\  |,|)  i.  and  all  minits  and  outputs  are(  .MOS 
1  d  L-compatihle.  Most  of  the  on-chip  circuitry  rci|uires  a  0-  to  5-volt  level  for  logic  lows  and  higic. 
respectivelv .  l  lierefore,  a  level  inverter  is  necessarv  to  convert  TI  L  levels  to  the  MOS  levels  needed 
.A  TTL-to-MOS  input  buffer  is  shown  in  Figure  l‘>.  It  consists  of  two  inverters  m  senes  Also,  input 
protection  from  static  damage  is  provided  on  the  input  M  I  aiul  R  1 .  Fins  level  but  ler  is  used  on  the 
.ippropriate  ininits 


3. 


I  PROM 


Si.'\er.il  tor  ROM  lcv.'liiioP 

ul'Ic  '.'oiinkIcicvI  as  saiuliria.lcs  Urr 
mipkiiK'iiliiiLr  the  iiiemorv  liiiKtion  ol  the 
Rt  iinuilats'  alaoi  itliiii.  Prmte  eoiisiiler- 

atunis  in  the  soloelion  ol  ilia  opliiiiimi  ROM 
Is\  hiiolor’i  loi  the  IMI’I  I  SU  are: 

•  L  ser-proei amntahle 

•  1  ase  ol  repi oarainniiiia 

•  Militai  i  eio  iroiiment  eonstrainis 

•  Siiiele  suppK  \o|iaLH'  operation 

•  Stalls  on-slup  loeie 

•  I  siahhsheil  K  v  hnoloe'. 

•  NoniolatiL 


Vpp  (5  TO  25  VOLTS) 


Vp  (0  TO  5  volts) 

TO  ADDRESS  SE  LECT 
The  eleetrieali\  erasable,  proirrainnial'le  RO.M  I - J  ^  -p.p.  pvpo  ami-i 

tl  PROM,  N-ehan,:e(  MOS  iN.MOSt  Kvh-  ^  PROGRAM  ENABLE  )PE) 

iiologe  vsas  jinhaeil  to  be  the  I'est  lecliiK)l-  I —  I 

o;j\  to  meet  the  reqinienieiiis  ol  the  proirrain. 

1  he  asKailtaee  o;  an  I  I’ROM  les  luiolui;\  is 

that  It  Is  Liser-pi oei  aiiiniei!  aiiel  not  mask-  Figure  1  ?,  VOItajic  Divider  Ciriniit 

proeraiiinieil  Anoihei  iinpori.int  .iiKantaee  ol' 
the  l.l’ROM  Is  that  iIk  piom.im  eaii  be 

eles  triealK  erased  .A  tiaitsj'.neni  <|nari/  wiiulow  covers  the  l.l’ROM  paekaite,  !  rasiiij;  the  1  PROM  is 
a  simple  mallei  ol  e\i''.sim-  the  wnulovv  to  ultraviolet  liirlit  Altei  eiasiiiL'.  the  I  PROM  eaii  be 
proL'i'amiiied  apaiii 


Figure  1?  V  \ oltajic  Divider  (  ireiiit 


I  he  I  PR(),M  Ics  hiioiofjv  .illovvs  !or  a  summon  ship  in  iIk-  inventorv  lobe  peisonah/evl  loi  a 
partieiilal  aleonliim  but  it  s.m  aKo  be  e.isiK  leusesl  later  |oi  an  s'lilirelv  shtlereiil  ai.eoiiilim 


Eiiliire  IS  I’roCTamiiiiiii;  Miiltiplevei 


Aiiullioi  illipui  Liiil  ^  >)ilsnUialiii]i  1-  iImI  I  1'I\(»M  ai.  M.ia.iii  AliiJi  ii|n'ij|',  <  r.  l  i 

tl;  I'lll  nulllaiA  iMililv  111  /A  (  ti)  'IJA  <  .  I  a-.i  r- iuk' .  I--  ..a-  ;:i  |  1  -  c  1 1 . .  I  H  a .  iiiilit.il^ 

rK()\I  taiiuK  r.iiipm;.'  Iiina  ■  K  (Ik  --  !i)2-i  I'lK-.iI-i  vlK  i  !  S\1  I  i;,;.;  iv  ja  ,Nk  p.iil 

■  i| .1-  a  IK  ■  N  al.r. ,'!■!•■  aiiii  K-|>i  n.'i  an  i  a  ..i  ('1 .  i;  11.;,  Il'I'.ilM  I .  .  m  i '  >1  u; a.  aN'i 

:n'ii  'il.iilk.  aIiu'i  luis  Itic  a>l\ an  la.Ki.’  tilal  Uia  ■  wl  i.i.a-  ■.  .a;  1  a  ! in- a.  a  i.  ilia  A.i'.  |  i,:  i!,a 
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a  ■ '  I  am  I  iia  I  'la  niamora  .  (In  is  nu  la  aMin  I  lia  ii  sal  .  ■:  w; ' n  .  ;  i  I'l  |'l 

Ilk'  -mipliin'd  I'i.'sk  aiiaala.i;  lla  il’Ki)'v|  '  ■ 
an^lla^s  iiipills  ara  dlnainaal  iiuin  Ik,  n:l  saiial  in;  i  .  i 
'  Pa'  sli il  t-jiul-aa  a  ninuiala  >  Iran; I  111  '  1. Pal  i  ■  la 

i-'iiaK  usad  in  lla-  niaiinir.  na  daii'vad  h'.  tli.'  ai):iin'l|,i 
.  iiini;  aaii  I'a  pm  i;jrjninia'l  ii\  liMiliin  ill'.'  asaa  a  n'l.i 

n  ull  ( spi'  Sn'nsaaliiin  II  \  d '  alula  iIk-  i,ia':.:"i'  .  'i 

"1  ipiil  linas. 
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DATA  OUTPUTS  (PROGRAM  INPUTS) 
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8  STORAGE  DEVICES 


A.  BIT  PLANE  CELL 


FLOATING  GATE  GATF 


B,  CROSS-SECTION  OF  EPROM 


TRANSISTOR  DRAIN 


GATE 


FLOATING 

GATE 


SOURCE 


By  siip|i|\  ing  ihi.'  (.'i)rii.'i.'l  .uiLiii'sM.s  lA,, 
ihrouL'h  A,s  I.  llic  X-  aiui  'i  soks  i  the 

proper  12  nteniory  eioraee  Jer  iees  where  lothe 
hiehs  or  loizie  lou^  ha\e  been  previoiisK 
stored.  These  12  loeu  levels  (hitsi  are  then 
latehed  into  the  inemorv  ilata  lateh.  wIikIi 
leeds  the  sill'  i-and-aev'iiiinilale  i  iie  nilr\  , 

1  he  .\  and  ^  ileo  uie  uieiiitl  \  is  shown 
m  I  lenre  2.'  The  input  address  and  its  d.iia 
'.'oinplenienl  aie  haid  wned  lo  ihe  devOvle 
driver  iiansisiois  iMi)  ihu'iieh  NTs  i.  piovidine 
tile  neeessaiv  H(  I)  lo  (k  ^  iill.d  de.  ode  I  hi'  six 
drivers  lor  ilk'  \  de.  .ule  will  piovkie  2'' 
dilleieiU  eoin  hina  I  ion  .  lesiilline  in  ('4 
possible  seleUloiis  .\I  Ilie  s.iine  IniK',  tlk'  't 
lies  ode  piovides  '  eom  bi  nat  I  oils,  lesultine  in 
eis’ht  possdde  seke  lioils,  thus.  In  iisiiie  .i  b  lijt 
adihess,  II  is  (Kissihk'  lo  sek\  I  2'  oi  's  I  2 
si  oi  I I'l.  h  u  a  1  It  in s 


C  MEMORY  STORAGE  DEVICE 


(hue  the  .nkliess  is  siipplu'tl  .nu!  the 
I'loper  stoiai''.-  lo.  alion  has  been  seleelevl.  it  Is 


l  iiliire  22  I'l’KOM  (  ell  Strut  lure 


ill  'll  lo  ik'IcmiiiK'  IIil-  Ihiik 

^Ul;  1.1.  I'ciiiLi  rillici  .1  luiik  itiiili  urn  louu'  low. 
llll^  luiK  tiiiii  Is  I’ci  loniK'il  In  iIk‘  simisl 
.iiupliiu'i  shov.ii  ill  l  iiimn-  24. 

\\  lu  ll  Ihc  I'luiV'i  solcsliini  lins  lii'i'ii 
III. Ilk'  I'lv  llir  ik'iuik'  illi'lllIIA.  lllc  st'llsC 
.iiiil'IiiK'i  ik'U  I  iiiiiR's  till'  lo.uii-'  sl.iK’  stored 
111.  ii'  in  ineiis'.ir’ii'j  ilie  eli.ii'ee  sloivil  on  the 
iksiliiie  LI. Ill'  ill  iIk  sIokilv  Ir.iiisislor  ( )iu'e 
II  n  k'li'l  is  di'li'i  k'll.  il  is  InilTi'li'd  to  drive 
'ii,  oiilpiil  nieiiioir  lateli  III. it  leeds  tlie 
1 1 1 1  .1'  iii-.'i  ..'iiiiiul.iie  i'll  eililiA  . 

Hel'iie  proeianiminLi.  the  niemoiA  is 
I'l.is.  u  i'\  iNposiim  the  ihi)'  tliroii.eh  the 

ll'.illspaii  III  '.viiuiou  to  lliL;h  deil.sil;,  iilll';i- 
1.  inlet  Ipelii  (n.ivek  nelh  2..sa7  aiigsironis). 
the  i'ei'oinuieiidei.1  miniiuum  exposure  dose  is 
|s  '.ival  l-sei  iiiuls  per  si|uare  eeiitimeter.  -M  ler 
eiasuie  (.ill  Inis  are  iii  loeie  liii'h  statel.  loeie 
ions  ale  pi ojii ainnied  iiilo  the  desireel  loca 
lions  \  Ion  leu'l  e.in  onl\  be  er.ised  h\ 
ultr.iv  lolet  liitlit.  rile  proeramminu  mode  is 
ii'liiewd  nlieii  \  |,|  is  l.iken  to  2.^  rolls.  Data 
Is  presented  in  parallel  12  bit  words  on  <di 
tliriiiit;li  (.),  ■  oi'l'  iL’ure  21.  I  he  eorrespondiiip' 
Ion  bits  .ire  proiriainin  'd  iiiio  the  memoir  I'y 
l.ikme  llie  lines  lorr  tor  .si)  ms 

4,  .Sliilt-aiul-.'\ei  ninniatc 

I  i,_uie  2s  Is  .1  I'lloe'k  (.liagram  ol  the 
ill .!  i  .md-ai  eunuilate  eireiiitir  wliieb  e0ii- 
■•linil'  .1  s,ii,i-ol-ptoduels  Irom  a  b■ll4'|y4tit 
p.iiMlIel  s.'iial  arillimetie  opei.ition. 

riiele  are  !  2  data  mpuis  that  come 
koi'i  the  I  I’lU.bM  sei  non.  S  eonirol  lines,  ami 
.’o  oiiip'it  lines  Sinee  the  l-sOM-.ieeiinuilate 
1  .  linniiie  pnodiiees  no  roinid-oll  error,  .ill  20 
lilts  .lie  savi.'il  .ind  presented  .is  outputs 

limine  2(i  shores  the  oper.ilion.il 
siiiiitiiie  ol  the  slid  l-and-.ieeumulale 

'.ill  II I  try 

I  rrelve  bit  memoir  ilala  is  piesenled  to 
:  I  ii  1 1 1 II  .idder  "A  "  in  pi  i  Is  on  the  i  ismp  edtre  ol 
'I  1  pli.isi  eloils.  ami  l.iti  bed  on  the  lalliiii; 


A.  X-DECODF 
(  I  OK  t4l 


B.  Y-DECODE 

(i  ore) 


I- inure  2-V  Scliematie  Di.apraiii  of  the  X  and  Y 
Matrix  Decoders 


ind  Buffer  Circuitry 


TCC  MEMORY  DATA 


OUTPUT  DATA 

Figure  2S.  Block  Diagram  of  die  Shift-and- Accumulate  Section 


edge  of  <1>.  The  suni-and-carry  results  are  latched  into  a  master-slave  register  on  the  next  rising  edge 
of  >I>.  By  feeding  the  sum-and-carry  latch  outputs  forward  (left)  to  remaining  full  adder  inputs,  a 
binary  multiplication  is  performed  per  each  clock  cycle. 

The  sum-and-carry  latches  are  cleared  on  the  rising  edge  ol  the  clear  accumulator  (C'LI  AK 
ACC  )  pulse  which  initiates  the  shift-and-accumulate  operation  fora  given  input  data  block.  .Sign 
magnitude  or  2's  complement  memory  data  can  be  used  by  activating  the  true  (TRCli)  pulse  or 
complement  (COMP)  pulse  lull  adder  control  lines.  respectivel>',  Individual  sums  and  carries  are 
latched  into  the  outptit  adder  on  the  falling  edge  of  the  latch  accumulate  (L.ATCH  ACC)  pulse,  and 
the  final  result  is  valid  on  the  next  rising  edge  of  the  LATCH  ACC  pulse. 

The  total  computation  ret|uires  2B  clock  cycles  per  input  data  block,  where  B  is  equal  to  the 
number  of  bits/in|nit  word. 

The  logic  implementation  and  truth  table  for  the  PIPL  LSIC  full  adder  function  are  shown  in 
Figure  27.  Note  that  the  outputs  of  this  circuit  are  complemented  sum  (QS)  and  carry  (QC).  allow¬ 
ing  a  minimum  number  of  components.  ’Fhe  full  adder  (FHL.AD)  schematic  diagram  is  shown  in 
Figure  28.  1  he  addend  input  includes  a  latch  from  which  tlie  true  or  complemented  data  may  be 
selected.  Transistors  Ml.  M2  through  M.S.  M().  and  M7  atul  .MS  are  a  pass  gate,  a  two-stage  inverter, 
a  feedback  gate,  and  select  gates,  respectively.  I'ransistor  M  14  is  the  load  device  for  the  carrv  NOR. 
with  the  series  string  ot  transistors.  Mb  aiul  MIO.  ANDing  the  addend  and  augend  inputs,  the 
[laralleled  rlevices.  MI2  through  Ml.f.  ORing  the  adileiul  anrl  augcm.1.  ami  the  series  device.  Mil. 
ANDing  the  OR  result  with  the  previous  stage  carry.  Comprising  the  sum  NOR  are  the  following: 
the  load  device,  M22.  the  paralleled  transistors.  ML'S  through  M  I  7.  that  OR  the  three  inputs;  the 
series  transistor.  Ml  8.  that  ANDs  the  OR  result  with  the  ('ARRV  output;  and  the  series  string  of 
transistors.  Mlb  through  M2I.  that  ANDs  the  three  inputs.  This  circuit  rerjuires  only  true  level 
inputs  and  a  total  ot  It)  devices,  excluiiing  the  latch. 

A  rCC  (2's  complement  coefficient)  control  line  is  provideil  by  the  user.  When  operating 
with  signed  niagnittide  input  data  less  than  X  bits  in  word  length,  the  TCC  line  can  he  taken  to  a 
iligital  low  to  ensure  that  the  upper  sign  bits  are  set  correctiv  . 


COMP 

TRUE 

‘  B)  C  QS  (A  t  B  '  C)  QC  ♦  ABC 


iguri-  2S  Sclionialii'  Diagram  of  iIk-  Full  Adder  \Nith  Inpiil  Data  Uitch 


1 


LSUM 


LCAR 


Figure  2‘).  Schematic  Diagram  of  the  (  arrv  Sum  Latch 


I'lie  ^Uln•alKl-eu|■I■\  hitch  sliown  m  l  igiiiv  i>  an  inverting  dii.il  mu'ter-cl.n e  conligiir.itinn 
wliieh  aeeepls  data  dm  the  tailing  eilge  ol  'l’.  prevents  eoinpleinented  data  at  the  output  on  the  ne\t 
rising  edge  ol  >l>.  and  lalehes  the  output  on  the  suirseiiuent  tailing  edge  ol  ‘I>.  I'asc  transistors  \1  I  and 
VI  d  and  leedl'.s  Is  transistors  M2d  and  M24  are  enahleil  In  while  pass  de\ises  M  I  s  and  M  14  and 
leedhaek  devices  \1‘)  and  MIO  are  enaliled  by  '!>.  Iransistors  M2  through  MS  and  Ml  I  and  M12 
imt'leiiient  the  dual  master  two-stage  inverters,  aiul  transistors  M  1  ,S  and  Mlti,  Ml'*  through  M2  2. 
■iiid  M2s  and  Vl2('  implement  the  dual  slave  two-stage  inverters.  Devises  M  I  ami  M  IS  aie  p.iialh  led 
with  M  I  .s  aiul  MWu  respectively,  in  the  dual  first-stage  slave  inverters  to  provide  .i  ele.ii  c  .ipabilitv 

d  he  output  ailder  eireuitry  uses  the  charge  storage  capabilities  of  NM(JS  devices  to  implement 
a  compact,  high-speevl.  look-aheail  carry  function.  Figure  .^U  shows  a  4-bit  2's  eompleiiK  iit  .uhh  i 
I  he  e.vciusive  OR  (  XOR)  circuitry  is  static  NMOS  logic 

file  NOR  gates  showm  are  a  subset  of  the  XOR  gates  and.  therefore.  rei|uire  noevtr.i  devices 
to  realize  a  carry  decode.  'I'lie  fast  carry  is  done  with  dynamic  ratioless  eireuitrv  lM5  through  M  1  .s  i. 
where  capacitive  nodes  n,,  through  n,,  can  temporarily  store  clocked  signals  .V  I  AK'II  .\f  ( 
pulse  preeharges  nodes  n,,  through  n,  to  a  logic  high  voltage  through  transistors  M.s  through  Mb 
When  FAFC'II  AfC  pulse  goes  low.  noiles  n„  through  n,  are  selectivelv  discharged,  based  on  the 
output  voltages  of  the  input  NOR  and  XOR  gates  existing  on  mules  n.,  through  iin  .  When  nodes  ii,, 
through  n,  have  settled,  the  complemented  sum.  S.  ot  A  and  If  appears  at  the  outputs  ol  the  tm.il 
XOR  gates,  file  dv  namic  ratioless  carry  allows  the  adder  to  operate  at  high  speeds  with  a  nummiim 
number  of  tr.iusistors  and  low  power  dissipation 
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MI  X;  aiul  will  piiKliko  a  l.AK  II  Ol'll’l  l 
\ia  NOR  gala  (iii  the  aaiiK-  '!>  aloak.  Tlia 
(Hitpul  ol  NOR  pale  N^  is  also  hiilTereil 
lluoiipli  hiilTer  Rl  ;  to  (MOMde  an  external 
DATA  \  ALIO  pulse  A  tinnnp  diaprain  ol  this 
liineiion  is  shown  In  I'lunre  x’’.  Lite  lateh 
aeeimuilator  ainl  lateh  or.tpnl  waveronns  are 
shown  lor  I-  through  K-hit  input  veortls. 

The  eontrol  multiplexer  is  shown 
seheinatiealK  in  L’lgure  xS.  I  his  multip'lexer 
i.onsists  of  three  input  'tapes  (TIT.  level) 
wliieh  provide  the  neeessarv  eontrol  logie  to 
dri'.e  the  deeotle  eireiiiiiA.  The  deeode 
eireuitrv  seleets  the  proper  data  path  hy 
turning  on  the  associated  transistor  (M2(), 
M:?.  M.'U.  M:s5.  MTO.  M45,  MSU.  and  M55). 

A  block  diagram  of  a  l)-controller  latch 
is  shown  in  I'igurc  xd.  Lhis  latch  is  imple¬ 
mented  In  using  two  NMOS  latches  in 
cascade,  clocked  on  opposite  phases  (I'igurc 
,T)Bi  which  produces  a  l)-l\pe  latch  function. 


A)  BLOCK  DIAGRAM 


b)  circuit  schematic 

Figure  .^6.  Controller  Sel/Resel  Latch  Diagram 


•A  circuit  schematic  of  the  clock  buffer 
BTl  IS  shown  in  F'igure  40.  This  buffer 

reeeiees  the  master  clock  (TI  L  level)  off-chip  ami  derives  the  two  phases  0  to  5  volts  necessary  to 
rlrl^e  the  on-chip  circuitiv.  Input  proleetion  is  providetl  bv  M  I  4  and  Rl. 
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LATCH  ACC  AND  LATCH  OUTPUT  WAVEFORMS  ARE  SHOWN 
FOR  SEVERAL  POSSIBLE  8  BIT  WORDS.  ALONG  WITH  SEVERAL 
POSSIBLE  LOAD  WAVEFORMS. 


Figure  M.  Controller  Timing  Diagram 
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A  +-  B  C 

(a)  logic  diagram 


A  +  B  ^  C 
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(b)  circuit  SCHEMATIC 
Figure  41 .  NOR  Gate  Bus  Driver  Diagram 


I  lk'  NOR  gales.  N|  .  N.-.  N  ,.  aiul  N\ ,  aiv  iv()uiivil  to  drive  iniernal  buses  and.  iherelbiv.  not 
onlv  provide  an  NOR  t'unetion  Inil  also  proviile  revjuired  buft’ering.  .A  logic  operation  and  eireuit 
seliem.itie  is  show n  in  bignredl, 

7.  Progranmiable  Suin-or-Prodiicls  Operator 

|o  lullv  explore  tlie  .ircliileetural  implietitions  of  a  progrannnal'le  sinn-or-produets  operator, 
a  hardware  nnt'lenienlalion  of  lk|uation  1  has  been  designed  and  labrieated  under  a  parallel 
eonlraet  with  (  arnegie-Mellon  University.  The  I'readboard  can  oper.ite  on  either  sliding  or  Fixed 
blocks  ot  data.  \  block  vUagrain  ol  the  liardvvare  implementation  is  shown  m  f  igure  42.  The  breavl- 
boaixl  consists  ol’  nine  input  latches,  nii'.e  parallel-in  serial-out  slii'l  registers,  a  last  .s  1  2  X  12-bit 
niemoiv  lor  leinporary  storage  ol'  the  partial  products,  an  I  I’ROM  lor  permanent  storage  of  the 
p.iili.il  products,  sliitl-and-aceunnilate  circuitrx.  tri-state  output  latches,  and  control  circuitrv 

I  he  input  latch  structure  is  hardware-  or  sollxvare-selectable  lor  either  serial  data  entrv  at 
l),\I,\  IM’l’l  l>  or  parallel  data  entrv  at  U.M  A  INPUT  S.  DATA  INPU  I  T.  and  DA  l  A  IMH  I  P 
I  Ins  lacihlatcs  the  imiileinentation  ot  a  nine-point  transversal  filter  or  a  bv  sliding  window 
operator,  respectively.  1  he  in|iul  data  word  length  is  hardware-selectable  I'rom  1  bit  to  b  bits  and  is 
hardware-  or  sol  tvvare-selectable  as  2's  complement  or  sign  magnitude  tormat. 

[  he  We  ighting  coeriicients,  \V,.  determine  a  set  of  partial  piroducts  which  .ue  stored  in  a  512 
/  12-bil  high-speed  r.idoni  access  memory  (R.AMi.  Partial  products  niav  be  dovv  ii-loaderl  on  the 
data  bus  from  an  external  source  bv  a  controlling  processor,  I  he  partial  piroducts  are  hardware- 
selectable  as  2's  coinpleinenl  or  sign  magnitude  formal,  file  irartial  products  obl.nneil  from  the 
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RAM  diiniiL’  o|n.T;ition  ;ii\'  sLiinnK'd  in  ;i  v;iriy-s;ivc  ;K\iiMuil;ilor  with  il:il:i  shilliny  to  wciiilit  the 
M.miitk .ni^c  ()l  (.'.(ell  jviilul  p/oiiiKl.  Onipnl  il.ilj  (20  lijts)  is  lMilTeivi.1  hy  a  tii-st;ite  output  hitch 

1  he  d.it.i  input  section  consists  oh  nine  hitclies  connected  to  tonn  .i  d-s).iue-lonti-h\ -K-hit-\'.  ide 
sliilt  register.  I  lie  outputs  ol  e;icli  st.ige  ;nv  nlso  connected  to  nine  piiniilel-to-sernd  conversion 
registers  wliich  tonn  tlie  vluta  tor  ROM-accninnhite  operation.  Tlie  data  input  operation  is  loii- 
trolled  hv  the  INI’l  !  (ON  I  ROl  1  l.R.  The  controller  has  a  hanlware-  or  sol tvv are-selected  B(  U 
Value  count  ,is  one  oh  its  inputs.  I’he  INI’l"!  RRSli  T  line  is  pulsevi  low  to  iniliali/e  the  controller. 
1  he  INRi  r  S  I  ROHl  line  shihts  data  throueh  the  input  late  hes  ain.1  clocks  the  controller  on  its 
Icavline  edee.  .Al  ter  enoueh  strobe  pulses,  the  INRl.^l  ( ON  I  ROl.l.l  R  eenerales  a  l.O.AIJ  pulse 
which  hitches  the  vhita  iroin  the  inpnl  hitches  into  the  parallel-to-serial  registers  and  starts  the  liiiili- 
s|ieed  asvnchronous  COM  ROL  L  1. R  I  he  LOAD  pulse  also  resets  the  INl’l  l  COM  ROLL  I  R  so 
that  new  tlata  can  he  shitted  into  the  input  latches  while  the  ROM-accuinuhite  operation  is  takine 
phisc  A  second  INRU'l  Rl  SL.l  pulse  is  not  rei|uireih 

Lhe  ROM-accuinulate  CON  [ROLLliR  operates  liaiin  an  asvnchionous  internal  l(i.~-Mll/ 
Oscillator  (('tl-ns  periinl).  hills  is  the  ina.xinuiin  clock  rate  lor  the  conipoiients  selectevl  hor  the  slnii 
registers  ami  accumulator.  .Alter  a  LOAD  pulse  is  leceived  Iroin  the  INF!  1  CON  I  ROl  LI  R.  tin 
(  ONIROLL.I.R  seiiuences  the  operations  oh  the  FAR.ALLl  l.-to-Sl-Rl.AL  Rl  (dSII  RS.  the  FAR 
I  I AL  FROI)L(  I  MlMOR^  ,  the  SI  II I  i  -  ANI)-.A(  (I'M  I  L  A  1 1 .  iunclion.and  the  Ol  LFl  I  1  Alt  II 
It  .ilso  provides  signals  which  can  he  inonitoreel  b\  an  external  processor  il  dcsiieih  .Aitei  tlic  lina! 
INFC  I  S  I  R(  )B1 .  pulse,  the  result  ol  (he  ROM  acciunuhite  caL  uhiiion  is  av ailanlc  at  the  1  R l-S  1  .A  I  I 
Ol  IFl  I  I  \l(  ll  m  I  ♦  ('  j|  internal  clock  c\ cle’s.  Idle  adihtional  (d  ,■  clock  cedes  ]S  due  to  the 
pipelined  ai  shite\'tuie  I  hus. 

'  KOM„.oinu,i.,H-  "  1'’*'  "  ns 

where  ’^*v'  tnne  leipiiled  to  provcss  the  data  aiul  R,  is  iIk  inciii'e:  ot  mcii  il  is  .in  I 

hits  111  each  input  vial  a  w  old  Lor  S  hit  rlat.i,  this  is  S“(i  i,s  lhe  R(  'M  -  .i  s  unr.dil  '  ON  I  R(  >1  I  I  R 

will  hring  the  Rl  AID  line  high  vvIk'ii  IIk-  iHiipiit  is  ,i'.,iilahle  .nul  will  inn  ,dlo\\  aiiothei  us.di 
nv  ei  w  I  itc  I  lie  output  la  tell  data  iiiilil  tlie  Rl  AD  \<  k  in  put  is  piiik  d  hi. !;  h  .  1 1  e  ,  "ii ' ,  oPi;  e  n  i>. ,  ■ 
sol .  1  he  Ol  1  FI  1  HI  I  I  I . R  I  I  L.l  line  goes  hiigh.  tiu  1  NFC  I  S  I  Rt  )HI  line  is  .nhd'i  t ,  >1  , .1  ;  1 ,, 

internal  ossillaloi  is  stoi'ped  il  .in  oveivvnle  eomhlion  ..-xisis  1  Ik-  Hi  t.il.  o  .ip  .is  .n  ,,,ii  .  u  I  , 
pulling  the  c\ tern.il  Ol  1  FI  1  I  N.A HI  I  line  low . 

(twine  to  the  pipeline  oiLMin/ation  ol  the  ROM-.k  s  uiniil.iloi  h.ndw  n.  .  In  l'\K\l  1  I  I 
SI  R 1  .A  1  R I  (  dS  I  I  RS  c.in  he  hiadeil  as  soon  as  H  hits  ol  d.it.i  h.iv  e  been  sn  1 1  K  l  ■  .,1  •  n  in,:  i  . 

H^  internal  cloek  sVsles  ot  oil  •  H.  nanosesonds  .il  lei  lhe  List  INI’I  I  S  I  ROHI  peds,  la  s,  | 

vlata.  this  IS  4M)  iis.  whkh  gives  a  throughput  rate  ot  .d'oul  2  MM/  Il  the  INFI  I  (  I  iN  1  K(  )|  1  |  R 

generales  ,i  LOAD  piiLe  helore  lhe  FAR.M  1  I  1  SI  Rl  Al  Rl  (dSIl  RS  Us  '-mi' I  ;,a  1  Ihs  INFI  1 
HI  1  I  I  R  I  11.1  line  will  go  high  .nul  lhe  INI’l  I  S  1  ROHl  hiu-  will  is  inh  n  i  ,a;  i  u  1 1 ,  i  lie  i  e.  - 1 .  i ' 
arc  einplieil  to  pis'Venl  ov  si  w  i  it  nig  aiiv  d.it.i 

1  mill  ills'  above  slos  iission.  p  s.in  he  seen  llial  ills  iii.ixiinuin  inpi.l  .l.il.;  i.u.  il  p.  n.i-  oi,  p  . 
nninl's-r  o|  hits  m  the  inj'ul  vl.il.i  words  iH,  i  .iml  tin  inunt'  i  ot  IM’I  1  s  1  Ki  ihl  '  i  h, 
niimhs'l  ol  INFI  I  S  1  ROHI  piiHes  (  N  l  Inlweeii  |' n  .hlsl  to  s,  a  i  d  on  .  e' sio",  s  , !  i  :■  p,,  i.;, 

I  ipei  .ilions  pci  lomu'sl  1  Ol  sliiling  ,!  ■  '  sn  't  •  1  lilli'i  .ippli  .il,ons  , .  ,  i  1  . i 

hs'fws’sni  p.ir.ills'l  lo-ss'ii.il  s  oiiversions  I  ,n  noiisL-dnig  a  '  vviudo-s  ,  .j  ,  i  ■  .  p  s  I-  .  ;>  .o 


I  t 


arc  needed  aiul.  I<n  a  *'  ■  |  traiislorm,  nine  slrohe  pulses  aie  needeil.  I  lie  maeinuiin  inpiil  ilala  latc 
n  eiven  I". 

I M  \  \  l\  “  •  '  IK  '  <’•!  ns  I 

I  .'I  si  id  me  u  indiiVK  m  irans\  ersal  I  liter  applkalions  with  X  hii  tiala.  i  m  \  e  -  I  Ml  I  /  I  “i  .ippli,  a 
tiiin  sin  h  as  an  X  '  I  1 1  anslorin  \Mtli  X-  hit  data.  Ki  a  \  |\  “  I  <i  M  1 1  /  . 

I  III  ina\inuini  iuit|nit  dat.i  I'.tte  is  alwriNsgl'icn  he 

i\i  \\  oi  I  =  I  *  IK  '  *'** 

Pic  hread  h'laid  aUii  can  operate  in  pal  .die  I  wi  lli  l\e  o  other  hre.ull'i  mi  iK  to  |n  o\  ide  I  li  roiii’l  i 
pnl  .It  le.il  tune  I  l  \  I  d.d.i  lates.  liiiee  control  lines,  shoun  ilashed  in  1  ir-’iiie  42,  .ire  pioeided  to 
s\  iKhroni/e  tins  opei.dioii  I  hese  lines.  lA  RLSI,  I  OP  11*11  ,  lA  S  I  KOMI  .  and  PI  I  \R  OP  I  I’l  I 
1  N.MtI.l.,  are  normalK  >i  eonnecteil  when  oireratiil'r  a  siiprrle  hieadhoanl 

1  he  proar.iininahle-sum-or-pi'xliiels  hreailhoaial  is  1  2  a  12  ■'  inehes.  weielis  “  pounds,  .nid 
dissiiMte-  I  .A  w  iM  I  iniie  4,'  is  a  photograph  oi  this  hieailhoaial,  i  or  si/e  eoin p.irison,  .i  4()-pin 
rhi  d-indine  p.ok.ige  is  shown  arli.uenl  to  the  eleetronies  hoanls.  With  llie  e\septlon  ol  the  input 
eoniroller,  output  eontroller,  .nul  the  I  FRO.Ms  for  proL'tam  ilownloading,  Ihe  entile  eleetroiik 
hoaids  h  i\e  heen  iniegi'.iied  onto  the  1*11*1  I.SIP.  I  his  represents  nionohlhisalK  integi'.iting 
.ipproMin.ileh  discrete  inlegtaUai  eireiiits 

B.  PIPE  LSIC  APPLICATIONS 

I  111'  suhseetion  is  .i  ilisenssion  of  .ipplie.itioiis  oi  the  PlI’P  l.SIC  des^riheil  in  the  preceding 
see  tion,  I  he  tv  pes  ol  oiiei.itioiis  as  well  .is  operational  eonsliiiints  are  discussed 

I,  Introdiieiion 

M.in\  digit. il  sign.il-proeessing  and  iinage-(iroeessing  .ilgorillnns  rei|uire  operations  ol  the  lorm 


M  I 


1  0 


wiiete  the  \V|  represents  .i  set  ol  lixed  weighting  coefficients  and  the  X,  represents  .i  set  or  seduence 
ol  input  v.dues,  I  i|u,ition  .S  can  he  iiseil  to  c.dciihite  Ihe  coelficienls  ol  \arious  transforms  such  .is 
l  ourier  Posme.  lladam.nd.  Il.i.ir.  etc.  Where  iwo-diineiisiunal  transforms  .ire  neerlei.1.  successive 
one-dnnensional  ir.inslornis  can  be  iiseil  il  Ihe  Ir.inslonns  .ire  sejiar.ihle  Fotr  image-processing  a|iph- 
e.itions,  I  i|u.ition  .x  delmes  the  disirele  convidulion  ol  a  tW’o-dimension;il  input  im.ige  with  a  con- 
solution  ,nta\ .  1  hese  m. itheinalk.il  operations  are  based  on  the  .uliacent  pixel  values  and  .ire  termed 
neighhoi liooil  oper.ilors  I  x.nnfiles  of  neighhorhoorl  operators  incliiile  noise  smoothing,  edge 
I  rispenme.  hne.ir  edrte  enh.m.  enient.  etc, 

I  he  lollow  mg  'uhseclions  discuss  the  applicahihty  of  the  I’lPl.  I  SIP  to  matrix  operations  for 
i.diul.ilnii'  Ii.iii'loini  coel  I  icien  ts  .md  neielihorhoorl  oireratoi  calculations. 


Input  Output  ( Ousidertitiuiis 


k,  .  Ill  Ilu'  lull  Um'  111  till.  I’ll’l  1  Sl(  Is  .iM  ..luKi  sl.iuil  .U;-  III  lliv  iiipiil  cnilpul  i  ul.il  ]■  iiisin  | 

1  iji.iu  44  shows  tlk-  input  output  pins  ot  ttu  I’ll’l  I  S|(  I  liu  I’AK  SI  K  si  li  .  t  pm  iluiut 

i.uu  s  whi'tliiT  till'  I  SR'  opiTuli's  on  4  >  I  or  4  ■  4  lilniksi.i  .l.il.i  i-itla'i  o|  wln^ii  m.o  In  slulmp 

III  noiisliilin;^.  Ills'  input  svori!  louj^th  is  ilvsi-jn.tti'd  l"i  llii  44111  WORl)  1  I  Ntilll  pit's  1  mluhiit 

p  ii.ilk'l  input  words  S.  1.  or  1’  ;ii'i'  loudcd  into  tlu-  input  I.iIiIks  In  ilu'  INI’l  I  S  I  Ki  )HI  .md 

■  oMM'iti'd  into  hit-scri;il  words  hii  thu  MASII  K  t  I  I )(  k  .md  I  0\l)  Ilu-  I’ll’l  I.SK  is  I  .ip.ihk'  ol 
opm.itiii;'  on  4's  I'oinpk'mcnt  or  sipn  inupnitiidi-  d.it.i  .is  di  ii-imiiiid  In  the  si.iie  oi  ilie  4’s(  ()MI’ 
pm  A  I  ).\  1  .\  \  Al. 1 1)  I'lilso  111 lornis  1  III'  iisi  i  w In  n  1  li .  I’ll’l  I  Sl(  h.is  ^oinpk'ti'd  .i  i  iK  ul.ition.  I  he 
Sil'it  p.ii.illcl  output  is  obtained  In  lirinenip  tiu  eii.d'le  tl\l  imii  low  to  .ulu.ile  the  iri-state 
I  mti'uis. 

I  he  I’lPl.  I.SK  reiiuircs  .i  smele  .'snolt  j'owei  siii'P'i;.  .ind.  diiiinii  proei  am  mine  ot  the 
1  l’K()\l.  .1  45-\olt  proeraminnie  loltape 

•\  total  ol  .mS  pms  is  iet|uiii'd  ioi  UdI  Sdiil  input  and  4()  bit  output  opetatioils.  It  the  input 
wold  lenetli  is  rediued  to  <'  bps  and  onlv  -S  bits  o|  the  output  ate  used,  a  total  of  40  inns  ate 
needed.  1  lie  pins  rei|uired  lot  worddenetli  seleetioii.  input  l\pe  (parallel  or  serial  l.  and  data  rorniat 
id's  ii'mpleinent  or  maenituile  i  eaii  be  einninated  In  on-elnp  boiidine  to  the  \’,,||  or  eri;und  pin 
lor  tiirther  pin  reduition  on  li\ed  .ipplkalion. 
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Fittiire  44.  PIPl-  I.SK  I  ()  Considerations 


1  Ml  »  'I'MI  .11  K  iM.ll  ,  li.lUK  Icl  Istu  ^  i*l  llu-  IMI’l  I  sn  .il\'  i!  l'..- II  111  >t  I.iIlI  ''U  I'S'.'.  I  In  ii  1  Ik's-.' 
.  li.ii.K  u  iinIks  i.Milt  in  I', lit  irniii  tin.- .iKliilL-.liiri.' 'll  iiii|i|Liiicnl  tik- I'll’l  l.SK  .iinlm]i,iil 
1 1  n!  I !  ilk'  .  nk  ml  d  CM  tin  nl  the  .ii  c  lute.  I  me  Alter  ii  iIin.  iK.ion  oi  the  t\  |'es  u!  .  al.  iiial  ini  n  the  PI  I'l 
I  M(  IS  .  a  I’al'le  nt .  these  .  hat  a.  tel  tsl  i.  s  aik  east  I  \  Uliilei  sli  lenl  aikl  aj'l'l  e.  lalikl 

'  \t.ilii\  ()|)L‘iati()ii 


he  PIPI  1  SK  is  ideall\  smieU  Inr  I’latns  oi’eralicnis  of  tile  turiii 

V  s  |u„  w  ,  w  u  .  w,  u.  u„  vv-  I  r.\,' 


el.  "t  I  ei’ies.  Ills  the  ('riHlikl  "I  a  i.i\\  \  e.  t(  >1 .  A  .  aiul  ,i  .  ( iliiiiiii  \  ceti  ir.  ,\  Mnsl'ienah 
i'lu  alimis  reiimie  the  prndikl  ol  a  u eiuhl me  mad i\  aiij  an  in|Hil  \ e.  Ini  I  Ins  can  bn  r 


pi'i  I. ssnie 
.'i'lesenle'.l 


nn  111'.'  PIPI  I  SK  as 


\  -  VV  \ 


vv ,,,  vv,,  vv 


vv  .  .  vv  ^  vv  . 


\  iliaL'iain  n|  Ih  PIPI  I  SK  . .  i;i  I  unit '..I  I.'  iini'l.  ineiil  l'|..ad.vi  I'-sIhiwn  m  lan.ieh' 
Vila  PIPI  I  SK  s  are  nseih  with  ea.Ii  PIPI  ini  iL’r.iinnieil  u  1 1  h  t  he  w  ek'li  I  nii'  .  ne  1 1 1,  leii  I  s  o  I  o  ne  I  nu 
■1  VV  I  Ins  elniiinales  i  epi' ler  nnnnnL’  the  PIPI  I  SK  .  i  Inis  in,  reasii:  •  i!.,  speed  n|  ilu  .alenlalinn 
\li  tile  P  nipnls  lit  the  PIPI  I  SK  s  .ire  .untie,  ted  an.l  data  is  enl.i..!  se.  1 1  e. 1 1,,]  | .  \me  saiiipi. 
i  .  iii'ds  ,ir.'  rei|uired  t<i  Inad  the  PIPI  I  SK  s  I  lie  r.-,  tm  is  .al.aial.d  ni  p  nall.  i  tm  llie  paituul  ii 
.ahi','.  Ill  \  V  \alld  I'lilpnl  IS  a\ailable  e\er>  s.nnpl.' P''d'’'l  d  ' 'P'- i  a  n.e  mi  slid  me  'M' .  1  dal.i  a  liih, 
Mile  s.nnpk-  perinds  ate  ris|uir.'d  d'  I'rndnee  an  aiisAei  il  the  np.ialimi  id  I  .|i,ali.ai  is  pellemied 
'll  nmislidme  d-h\  |  data  him  ks 
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I  lio  ^  i  Mil u'lii  jl io!)  siliiuii  in  I  iL’iun  4o  win  Iv  usoil  to  iiilcuhiU’  lik'I  I  n  K'lil'  ol  \aniuN 
li.iipnn'iiiN 
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F  ieiiri’ .46  Wfiglitini;  Coefficient  Matrices  for  Various  H  /  1  Iransfonns 


1. 


<  2|  +  1  il\7r 

\  c<)>  k  = 
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I  'I .  Hi  I  ii.i  I  i‘i  \  ;  on;i 
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I  lie  discrete  co^inc  iniiislorm  c;m  ho  imploiiK'iilod  Lisino  1  isiuro  45 


I  ho  uoi'olitiipj  ocK’l'tlcioiU  iiKitncos  tor  X4n  I  liad.nnaid.  Ualsh.  and  llaar  traiisl'ornis  aro 
shown  in  f  iuuio  41 .  I  ho  transfonns  Uosoriboil  aho\o  all  liaro  roal  ooolTioioiils. 


rho  I  ounor  traiislonits  (irodiioos  oomplox  ooolTioionls  hooaiiso  oh  a  ooiiiplox  woiirhlino 
matrix,  I  ho  disoroto  I'ourior  translorni  ol  a  soipioiioo  is  doliiuai  In 


\I  I 


k  =  0.  I,  2.  .  ,  .M-l 


.NssLiiiiin.i:  all  X-poini  iransl(,)rni  ani.1  dolinini; 


W  =  0 
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I  (pialioii  i  I  sail  ho  rowritton  as 


I  (I 


till 


1  1  2i 


lid) 


wliioh  IS  iho  saino  I'nrm  as  l■.^^uatioll  5.  I  ho  woiirhtiiii;  ooolTioionts  aro  now  ooinplox.  roquirini.'  adtli- 
tional  I’ll’l  I  Sl(  s  to  oaloulalo  tho  roal  aiul  iiiiapinarx  hourior  ooori'ioionts,  I  his  is  shown  in 
I  iL’iiro  4 7 


SO 


H(z)r  Y(z)=  Ao  -  a,  z"'  f  A2  Z~‘^ 

X(Z)  I  -  B,  Z"’  -  Z~^ 

Yk=  A^  t  a,  X^_,  ♦  A^  X^_2  I  B,  Y,^_,  f  B2  Y,^.^ 


AAA 
0  1  2 


|B,  B^j 

'^K-t 

'^K-2 

PIPE 

PIPE 

a,.  A2) 

(B,.  B2) 

Figure  4‘).  Using  PIPE  LSIC  l«  InipIeineiK  Second -Order  Filter  Function 

1  he  diU’croiicc  equation  is 

'''k  =  ‘‘(i  -'^k  +  ‘‘1  '^k  I  ^  +  '’1  '‘'k  1  ^  '•'k  : 


V,  =  la,,  a,  a,  b,  b,  | 


I  (|nati(in  17  is  llie  same  lonn  as  I  (lualioii  6  and  ean  be  implemented  using  PIPl  l.SlUs  as 
■-liown  in  I  igure  4‘)  I  lie  X  ilata  secjiienee  is  loadetl  sei|uentiail\  into  the  I'irst  IMl’l  LSIC  containing 
sveigluing  eoelTieients  a,,-  a,  .  and  a-,  [lie  ouI|HiI  oI  the  I'irst  PlPI  1  SR  is  loaded  sequentialK  into 
the  second  IMl’l  L  SIC  to  eomirlete  the  calculation  ol'  I  quation  1  7. 

4  Neighborhood  Operators 

Mans  image-processing  algorithms  such  as  noise  cleaning,  eilge  detection,  and  edge  enhanee- 
meiit  ean  be  implemented  with  the  I’ll’l  I  SIC  operating  on  sliding.'  <  pi\el  blocks  of  the  input 
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Figure  SO  Two-Dimensional  Spatial  Convolution  for  3  X  3  Neighborhood 


image.  A  3  X  3-iiiput  array  is  spatially  convolved  with  a  two-tlimensional  weighting  arra\ .  as  shown 
in  I-'igiire  50.  This  5  .<  3  two-dimensional  spatial  convolution  is  ;i  \er\  powcrliil  image-p'■occ.^.^ing 
calculation  and  one  for  which  the  PIPF.  LSIC  is  ideally  suited. 

riie  following  subsections  discuss  some  commonK  used  neighborhood  operators  .ind  present 
some  experimental  results  of  tlie  programmable  sum  of  jirotiucts  hreatiboard  tlcscrihed  in  .Subsection 
11. A. 7. 

a.  \<)ise  ( leaning 

Main  images  contain  discrete  pixel  variations  that  are  a  result  of  noisy  sensors  and  \er\ 
objectional  from  a  user  viewpoint.  Simple  low-pass  spatial  filtering  can  eliminate  or  smooth  imrst 
such  noi>e  MiKc  the  noise  is  decorrelatcxl  spatialls  from  its  surrounding  pixels.  Figure  51  slu.iws 
these  low-pass  weighting  arra>s  that  can  be  used  as  weighting  coefficients  for  the  PlPl.  l.SK  .  As 
can  be  seen,  the  weighting  arraxs  arc  normali/cd  to  unit  weighting  to  present  an  inlensitx  bias 
into  the  processed  image, 

h.  I'nhanccincnl 

I  ilge  ei  hancement  is  used  to  accent  edges  ol  an  image  to  prox  ide  a  more  subjectixelx  pleasing 
image  Since  areas  of  high  frec|uency  (edgesi  arc  to  be  highlighted,  the  logical  operation  is  a  high- 
[Xiss  filter  I  Ins  can  be  done  spatially,  using  the  PIPl.  l.SK  as  a  3  X  3  neighborhood  operator.  Some 
weighting  arraxs  that  are  of  the  high-pass  form  arc  shoxvn  in  I  igure  52.  1  here  is  no  need  to  nor- 
mah/e  these  .irraxs  since  their  'Imiients  sum  to  units. 

(•  /.  (/gc  Detectors 

( )nc  ol  the  most  distinguishing  features  ol  an  image  arc  edges  because  thex  |irox  ide  informa¬ 
tion  on  the  phxsival  extent  of  objects  within  the  image  F.dges  are  defined  as  local  discontinuities 
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Figure  51.  Low-I’ass  Weighting  .Arrays 


Figure  52.  High-Pass  Weighting  Arrays 


III  ilir  imago  liiininanoo  or  amplitikio  love),  f  Jioro  are  ha.sioullv  Usar  motlioJs  (if  oelgo  Liotoolion  oiigo 
.  iili.iiKomont  t'ollovvod  In  tlirosholsliiig  aiul  oilgo  filloring. ' 

In  llio  odgo  onhaiKomont  llirosholding  niolhoii,  tho  input  imago  is  spatialh  ooiuoKod  vsith  a 
■i  111  imoar  weighting  arra\s  to  pioduoo  a  sot  of  gradiont  I'niKtions  \s  luoh  aro.  in  turn,  somhinod 
i’\  .1  Imoar  or  nonlinoar  fimotion  to  oroato  an  cdgo-onhanood  array,  lo  improso  odgo  \isibilit\.  tho 
■i.i'  K.\ol  map  IS  oomparod  to  a  threshold.  I  ;  if  tho  gray  level  is  greater  than  I  .  an  edge  is  assiii.ioil 
p’.'i.  Nl  it  tho  gray  level  is  less  than  I.  the  ileeisiim  is  no  edge.  I  he  seleetion  ol  the  tliroshold  is 
V.  I ;.  iiiiportaiit .  if  it  IS  too  high,  some  edges  w  ill  not  be  deteoted;  if  it  is  too  low.  noise  w  ill  bo 
ill  I  s  led  as  eilgos. 

I  here  are  two  ty  pes  of  edge-enhaneeinent  operators:  (.liffereiitial  erige  detostors.  siuh  as 
Kolierts,  Prewitt,  and  Sobol,  and  teniplate-matehing  edge-deteotion  such  as  eoinpass  gradient. 
Kirs,  h  three-level,  and  fi\e-level. 


In  edge  fitting  edge-deteetors.  subregions  of  the  input  image  are  fitted  to  a  two-ilimensional 
miidol  ol  an  edge  If  the  tit  is  elose.  an  eilge  is  assumerl  to  exist  with  the  same  parameters  .is  tho 
d.'o  model  I  dge  fitting  eannot  be  implemented  with  the  PIPI  I  SK  .md  is  not  disoussed  fiirthei 
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Fillin'  .■'.V  Weighting  Arrays  for  Various  Differential  tdge  Detectors 


d.  l)i})cn’iiii(d  !  dfic  Detedors 

I  he  Kobcrls  edge  deleetor  is  ai'plied  to  a  2  x  2  neigliisorhood  of  pixels  and  can  he  imple- 
nientetl  as  two  spatial  eonvokitions  of  the  in|nit  array  with  a  weighting  arrax ,  I  he  outputs  of  ilie 
two  coil'. oliitions  are  conihtned  to  protluee  an  edge  magnitude  to  he  coiiipareti  with  the  selected 
etige  threshold,  f  ,  I’lie  orieiitattoii  of  the  edge  can  also  be  calculated  Iroiii  the  outputs  eif  the  two 
coinoliitioiis. 

I  he  Sohcl  aiul  I’rewitt  edge  operators  tire  applietl  to  d  X  w  indows  of  jiixels  and  are  miple- 
meiited  In  two  p.iiial  c  uv.olutions  of  the  input  imtige  with  a  X  weighting  arrax.  .Xgain.  the 
inagmiude  and  orientation  of  the  edge  can  he  calculated  Iroin  the  two  coinolutions. 

.\notlier  dilTeiential  edge  detection  is  the  l.aplacian  operator,  howexer.  because  of  its  seii- 
sitixitx  to  points  and  lines,  it  is  not  a  xerx  eiricient  edge  detector.'  ' 

I  he  weighting  tiirtixs  used  lor  the  Roberts.  Sobel.  and  I’rewitt  edge  detectors  are  shown  in 
1  igurc  5.V  I  or  each  ofierator.  the  aiiiplitiule  ol  the  eilge  is  given  bx 

,.\ii..ii  -  1 1  'i  "  (i.jll  •  +  I't  '''  n.j  1 1  -  f  '  I  I  M 


OI 


A(  i.j  I  =  n  "  I  i.j  h  t  A  '  1  i.j  i; 


where  't"<i.|i  is  tile  eoinolntion  ol  the  ini'Ut  arrax  and  the  liori/'ontal  xxeigliting  arrax  and  ^  '  ii.ji 
is  the  sonxolulion  of  in|nil  arrax  and  the  vertical  weighting  arrax 


INPUT 


J  igurc  54  liiiplemeiitation  of  Differential  Edge  Detectors  Using  PIPE  LSIC 


I  nr  the  Roherts  operator,  the  cilgc  orientation  is  given  b\ 


(Mi.jl  -- 


TT 
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+  tan  ' 


t  or  the  Sobe1  and  I'rewitt  operator,  tile  edge  orientation  is  given  b\ 


(  UU 


i.j  1  =  Ian 


( :o  I 


I  he  I’ll’l  l.SK  ean  be  used  to  implement  the  Roberts.  I’rewitt.  or  the  Sobel  edge  detector.  .\ 
blo^k  diagram  ot  the  implementation  of  these  operators  is  shown  in  bigure  .''4,  I  he  input  data  is 
loaded  as  three  sets  ol  three  I'arallel  words,  [hree  sample  peiiods  are  reipiired  to  mitialK  load  the 
I’ll’l  l.SK  and  iM'ovivie  the  lirst  outinil.  Sueeeeding  outputs  oe^ur  each  sample  period.  1  or  the 
Roberts  operator,  tlie  weighting  eoeliieients  wi.iukl  be  rearrangeil  to  take  into  aeeount  the  Z  a  I 
weighting  arra\ 


e  h'lnplalc  Matching  /u/ge  Detectors 

In  templale-matt  lung  edge  delestion.  a  set  ol  weiglilmg  arravs  corresponding  to  the  eight 
iiiaior  compass  ilireelions  (north,  northeast,  east,  ete.i  is  eoinohed  with  the  input  image.  I  he  edge 
orientation  is  determined  b>  tile  direction  produeing  the  maxiiiuim  gradient  response  greater  tiian 
the  selected  liireshold.  1  samples  of  lemplate-matehing  weighting  arraxs  are  given  in  I  igure  55  tor 
the  compass-gr.i(.lient.  Kirseh.  three-le'c'l.  aiul  li\e-le\el  template-matching  operators. 


I  he  I’ll’l  I  SK  can 

I  iL'iiic'  'I'  I  he'  input  arr.p 


implement  the  tempi. ite-matc lung  edge  detectoi  e 
is  lo.uled  as  three  sets  ol  three  I'ar.illel  wools  m  .ill 


,isil\ .  as  show  n  in 
eielil  I’ll’l  l.SK  s 
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Figure  56.  Iniplenieiitution  of  leinplate  Match  Edge  Deteclors  U'-ing  I’llM.  LSU 


I  Ik  oiitputs  arc  compaicd  it)  ilclcrminc  llic  maxinuiin  response  and.  hciisc.  ilic  oiicnlalmn  ol  iIk' 
s'iIl’C 

/  I'.d^e  Detector  Ferfonnanee  Analysis 

In  an  attempt  to  deterndne  tlic  ivlalive  peirormaneo  of  the  edge  ileteetors  discussed  alH>\e  .m 
evaluation  has  been  perl’ormed  eom|saring  edge  response  as  a  fnnetion  of  actual  edgk'  oi-ieniaiuin 
i  lu  probability  of  correct  detection  as  a  function  of  the  probability  of  false  detect  ion.  and  a  I  ig  ui  e 
ol  merit  as  a  function  of  signal-to-noise  ratio  ' 

I  igure  5^  shows  edge  deteUor  resi'onse  lamplitude  and  orientalioni  as  a  IuikIiou  o|  .elu.ii 
edge  orientation.''  I'or  the  Koberls,  Preveitt.  and  Sobel  amplitude  lespi.mse.  both  the  sipi.iie  looi 
sum  of  the  squares  and  the  sum  ol  the  absiilule  values  1 1  (piation  1  ■'s  i  responses  are  show  n  Assail  lx 
wen.  the  Prewitt.  Sobel.  and  template  matihing  ami'liliuie  response  is  relativelv  invariant  to  edge 
orientation  while  the  Sobel  operator  has  the  most  linear  response  between  attii.d  eilge  in  leiital  loii 
.iiul  deteeted  etlge  orientation  I'or  the  tempialeaiiatshing  operalois.  the  vhllereiKc  between  aUual 
and  detected  edge  orientation  is  large  besaiise  the  tempiale-matshing  operators  measure  edge 
orientation  m  a  quanti/ed  step. 

I  he  performance  ol  various  evige  detectors  ni  the  preseiKe  of  aildilive.  white  (laussian  noise 
i.in  be  com[vared  using  parametru  curves  of  correcteil  detection  probabilitv  versus  false  dclcetion 
pn'Sibihtv  in  terms  of  the  detevlor  threshold'  1  igure  .‘'cS  shows  sir  h  siirves  |oi  veriieal  and 
li.e.'oiial  edges  (or  both  difterential  edge  ileleclors  and  lemplate-maK  lung  diUs  tois  with  sign.ii  lo 
"lOist  latios  of  I  t)  ainl  10.0  (  or  the  diileretilial  iletevtors.  the  Sobel  and  Prewitt  pertonn  IhIUi 
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^iire  57.  Edj’e  C/nidieiU  Anjpldiide  Re.spoi»>e  and  I>efec(ed-Ed^c  Orioutalion  as  Eniutions  of  Acttial  EdtiL*  OrieiKalion  (I  rom  Reference  6> 
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iftL-  i.nni|).iss  liiiiclicnt.  the  Kirseh.  the  three-le\el  .leu  -lli  I  he  ^  m  h.i.iee  ,iie  shM\\n 

111  1  iL'U le  fi 

6  ()per.'iti(iii;il  (  liaraeteristics 

Se\Li.il  ke>  parameters  deline  the  Dpei  iie  e  ,  iiu  I'll’i  l^h  lii; 

1  i|i  eral  ii  III''  (serial  or  parallel.  shdiiiL’  oi  ni  iidiiiie.  i  i-;  :'  wmd  ii'  hil'  ''  I'll'.  ele  i 

I  l’R(  )\1  a L '  es' t line  ;  at  mimiilal or  I  ime  ,  and  I  he  I  i •  'i  . ■  '  e  ;  1 1  ,i l  lal  i le' 

S  nil.  e  t  he  PI  I’l  I  .Sl( '  must  ai.  eess  I  he  ( 111  !  n  |  .  .  ■■  ■■  h.  : , :  e .  '  i  m.  In  1 1  H  o  I  lu  lei  i.el  1 1  '  1 1  I !  u 

ini'iil  'vi.oidi  to  lakniate  one  term  (d  the  an-e.-.i  i  :  •:  !■  ■  .  .d.ihir  inie.l  'iim  lh"'e  leini'  llu 

lime  le'inired  to  output  an  aii'wer  Irom  the  PIPI  I  M' 
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PREWITT 


Figure  61.  Sobel  and  Prewitt  Edge  Detection  Using  ProgrannnaWe  Sum-of-Products  Breadboard 


T  ,  =  .\1A.\(  I  ,  >  li  .  I  I 

where  f  is  the  on-eliip  hPROM  access  time,  aiul  I.,,,,,;  is  the  time  recjuired  Iss  the  accmmihitor 
fo  prevent  addressing  the  memory  incorrectls.  the  time  between  load  pulses  must  also  satisis 
laiuation  22. 

Recalling  that  the  number  ol  injnil  strobe  pulses  between  load  pulses  determines  vvhether  a 
sliding  or  nonslicling  type  of  operation  is  performed,  the  time  between  strobe  pulses  t  f^  ^  i  can 

be  given  as 


*  SI  Koto  '  I  O  \0  'lAMtl' 
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l  igure  62.  Template  Match  Kilge  Detection  Using  I’rogrammahle  Sum  of  Products  Breadboard 


w1k'I\'  N  is  llic  number  ol  input  strobes  between  lo.iil  pulses.  I  |  i'  tlie  tune  lietween  load  pulses 

as  delennined  In  l.riuation  22.  and  li  is  the  operating  time  lor  the  input  lalelies 

lo  maintain  proper  on-elliii  timiina.  the  master  eloek  <  I,  i  must  be  .ereater  tlian 

times  the  loail  rrequene>'.  or 


l  able  3  summari/cs  b.ciuations  22  throuiih  24  m  terms  ol  mpiil  and  oiitpLit  data  rates  lor 
difterent  tv  pes  of  operations,  word  length,  and  hl'KO.M  aeeess  lime  (assumme  .'()()-ns  aeeumulator 
timei.  20-Mtl/  input  lateh.  and  20-Mll/.  master  eloek  (design  goals). 

In  the  serial  IS  .■  I  sliding  l>  pe  ol' ojieration.  one  strobe  pulse  is  neerled  bet\^een  e.!.-i.  load 
pidse.  and  the  load  t'rei]ueney  is  ealeulaterl  Irom  Li|uation  22  to  be  I  2.s  .\lll/  loi'  ,s-bit  data  and 
I.io  \lll/  I'or  ('-bit  ilala,  assuming  l()()-ns  l.l’KOM  aeeess  time.  I  i)r  seiial  N  I  nonsikling  tvpeoi 
operations,  eight  strobe  pulses  are  reiiuired  betwee  i  each  load  puKe:  therelore.  the  ma\imum  load 
irequeiKA  is  1 .2.s  .Mil/.;  howe'er,  sinee  ilata  ean  be  loaded  in  the  I’ll’l  input  latehes  independent 
ol  the  parallel-to-serial  registers,  the  masimuin  input  ilata  rate  is  deteriniiied  bi  the  input  stroi'e 
I'requene'.  I  his  ean  be  ealeulated  I'nnn  I  iiuation  2.'  to  iie  Id  Mil/  lor  both  b-  and  ('-bit  daia  loi 
b  ■  I  nonslieling  t\ pe  ol  operations. 

lor  the  parallel  .S  •  .'  shiling  t).pe  ol  operation.  onl\  one  strobe  pulse  is  needed  belweeii 
load,  thus,  rroiii  I  iptations  22  and  2.'.  the  load  Irequeiie'  and  mpul  data  rate  i  input  strobe 
i|uene\  I  ean  be  ealeulated  lo  be  l.2.s  .\lll/  lor  K-bii  ihila  and  I. '>2  Mil/  for  (obit  dal.i.  with  ,iii 
f.l’KtiM  aeeess  time  of  I  dd  ns,  bor  parallel  .'  ■,  iions/idKig  I\ /)e  ol  Operations.  lliree  strobe  jndses 
are  required  between  load  pulses;  therelore.  the  mpul  data  rate  is  Id  .Mil/  lor  both  b-  and  o-bit 
data.  I  he  el  leel  ol  l.l’KOM  aeeess  time  on  mpul  data  rate  is  also  sliow  n  in  I  able  .v 

I  he  output  data  rate  is  given  b\  l,(|uaiion  22.  regardless  ol  the  I\pe  ol  operation,  and  is 
1 .25  ,M  1 1/  lor  S-bit  data  and  I  (w  .Ml  1/  for  ('-bit  data  w  ith  1  dO-iis  l.l’KOM  aeeess  time 

lo  aehie'e  real-time  ilO-.MIl/i  operation  on  S-bil  dala  with  5d-ns  I  I’KO.M  anessiime,  lour 
Pll’l  I  Sl(  s  lan  lie  operated  in  parallel  as  show  ii  in  I  igure  o.'  for  iransfomi  eaLiilations  and  1  igure 
('4  tor  neighborhood  operators.  In  both  iinplemenlalions.  the  input  ilata  is  demiilliplesed  1\\  the 
input  strobe  pulse  ol  the  I’ll’l ,  I  Sl(  into  the  lour  paiallel  I’ll’l  s  and  the  tri-stale  outputs  aie  nuilli- 
I'lexed.  using  the  enable  pulse  of  eaeh  I’ll’l  LSK  .  .X  block  diagram  o!  the  I’ll’l  L  Sl(  Oeimmstia- 
tion  Brassboaril  is  shown  m  l  igure  (>5. 
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